Method for identifying lineage-related antibodies

ABSTRACT

In certain embodiments, the method may comprise: a) obtaining the antibody sequences from a population of B cells; b) grouping the antibody sequences to provide a plurality of groups of lineage-related antibodies; c) testing a single antibody from each of the groups in a bioassay and, after the first antibody has been identified, d) testing further antibodies that are in the same group as the first antibody in a second bioassay. In another embodiment, the method may comprise: a) testing a plurality of antibodies obtained from a first portion of an antibody producing organ of an animal; b) obtaining the sequence of a first identified antibody; c) obtaining from a second portion of said antibody producing organ the sequences of further antibodies that are related by lineage to said first antibody; and, c) testing the further antibodies in a second bioassay.

CROSS REFERENCE TO RELATED APPLICATION

This patent application is a continuation of U.S. application Ser. No. 13/748,507, filed on Jan. 23, 2013, granted U.S. Pat. No. 8,969,013, which is a continuation of U.S. application Ser. No. 13/552,517, filed Jul. 18, 2012, granted U.S. Pat. No. 8,617,830, which is a continuation of U.S. application Ser. No. 12/878,925, filed on Sep. 9, 2010, granted U.S. Pat. No. 8,293,483, which claims the priority benefit of U.S. provisional application Ser. No. 61/241,714, filed on Sep. 11, 2009, all of which is are incorporated by reference herein in their entirety.

INTRODUCTION

Antibodies are proteins that bind a specific antigen. Generally, antibodies are specific for their targets, have the ability to mediate immune effector mechanisms, and have a long half-life in serum. Such properties make antibodies powerful therapeutics. Monoclonal antibodies are used therapeutically for the treatment of a variety of conditions including cancer, inflammation, and other diseases. There are currently over two dozen therapeutic antibody products on the market and hundreds in development.

There is a constant need for new antibodies and methods for making the same.

SUMMARY

In certain embodiments, the method may comprise: a) obtaining the antibody heavy chain sequences and the antibody light chain sequences from a population of B cells of an animal, wherein the population of B cells is enriched for B cells that produce antibodies that specifically bind to a target antigen; b) grouping the heavy and light chain sequences on the basis of sequence similarity to provide a plurality of groups of antibodies that are related by lineage; c) testing a single antibody from each of the groups in a first bioassay to identify a first antibody that has a biological activity; and, after the first antibody has been identified, d) testing further antibodies that are in the same group as the first antibody in a second bioassay, thereby identifying a second antibody that has the biological activity.

In other embodiment, the method may comprise: a) testing a plurality of antibodies obtained from a first portion of an antibody producing organ of an animal in a first bioassay to identify a first antibody that has a biological activity; b) obtaining the sequence of the first antibody; c) obtaining from a second portion of said antibody producing organ the heavy and light chain amino acid sequences of further antibodies that are related by lineage to said first antibody by PCR, using probes are designed using the sequence of the first antibody; and, c) testing a plurality of the further antibodies in a second bioassay to identify a second antibody that has said biological activity.

In certain embodiments, the method provides a means by which significant portion of the entire antibody repertoire of an animal can be screened to identify an antibody with desirable properties. In certain embodiments the method involves first identifying a single antibody with desirable properties, and then screening other antibodies in same lineage group (i.e., a clonally related group of antibodies) as the identified antibody, to identify other antibodies that may have even more desirable properties relative to the identified antibody. As such, the method provides an efficient way to screen for and identify new, biologically active antibodies. After identification, the second antibody may be tested in further assays, and, if it is suitable for use as a therapy, may be humanized, for example.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A and 1B schematically illustrate two embodiments.

FIG. 2 shows the amino acid sequences of selected KDR-binding antibodies. Page 1 of FIG. 2 shows amino acid sequences of the heavy chains. Page 2 of FIG. 2 shows amino acid sequences of the corresponding light chains. The amino acid sequences shown in FIG. 2 are of antibodies that specifically bind to KDR and block VEGF activity. From top to bottom, FIG. 2 (page 1 of 2) SEQ ID NOS: 1-47 and FIG. 2 (page 2 of 2) SEQ ID NOS: 48-94.

FIG. 3 shows the amino acid sequence of 20 exemplary VH3 regions of unrelated rabbit antibodies. From top to bottom SEQ ID NOS: 95-114.

FIGS. 4A-4H show exemplary methods by which related antibodies can be amplified.

DEFINITIONS

Before the present subject invention is described further, it is to be understood that this invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited.

It must be noted that as used herein and in the appended claims, the singular forms “a”, “and”, and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “an antibody” includes a plurality of such antibodies and reference to “a framework region” includes reference to one or more framework regions and equivalents thereof known to those skilled in the art, and so forth.

The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided may be different from the actual publication dates which may need to be independently confirmed.

The term “nucleic acid” encompasses DNA, RNA, single stranded or double stranded and chemical modifications thereof. The terms “nucleic acid” and “polynucleotide” are used interchangeably herein.

The term “expression”, as used herein, refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene. The process includes both transcription and translation.

The term “expression cassette” refers to a nucleic acid construct capable of directing the expression of a gene/coding sequence of interest, which is operably linked to a promoter of the expression cassette. Such cassettes can be a linear nucleic acid or can be present in a “vector”, “vector construct”, “expression vector”, or “gene transfer vector”, in order to transfer the expression cassette into target cells. Thus, the term includes cloning and expression vehicles, as well as viral vectors.

The term “operably linked” refers to an arrangement of elements wherein the components so described are configured so as to perform their usual function. Thus, a given signal peptide that is operably linked to a polypeptide directs the secretion of the polypeptide from a cell. In the case of a promoter, a promoter that is operably linked to a coding sequence will direct the expression of the coding sequence. The promoter or other control elements need not be contiguous with the coding sequence, so long as they function to direct the expression thereof. For example, intervening untranslated yet transcribed sequences can be present between the promoter sequence and the coding sequence and the promoter sequence can still be considered “operably linked” to the coding sequence.

The term “plurality” refers to more than 1, for example more than 2, more than about 5, more than about 10, more than about 20, more than about 50, more than about 100, more than about 200, more than about 500, more than about 1000, more than about 2000, more than about 5000, more than about 10,000, more than about 20,000, more than about 50,000, more than about 100,000, usually no more than about 200,000. A “population” contains a plurality of items.

The term “introduced” in the context of inserting a nucleic acid sequence into a cell, means “transfection”, or ‘transformation”, or “transduction” and includes reference to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell wherein the nucleic acid sequence may be present in the cell transiently or may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid, or mitochondrial DNA), converted into an autonomous replicon.

The terms “antibody” and “immunoglobulin” are used interchangeably herein. These terms are well understood by those in the field, and refer to a protein consisting of one or more polypeptides that specifically binds an antigen. One form of antibody constitutes the basic structural unit of an antibody. This form is a tetramer and consists of two identical pairs of antibody chains, each pair having one light and one heavy chain. In each pair, the light and heavy chain variable regions are together responsible for binding to an antigen, and the constant regions are responsible for the antibody effector functions.

The recognized immunoglobulin polypeptides include the kappa and lambda light chains and the alpha, gamma (IgG₁, IgG₂, IgG₃, IgG₄), delta, epsilon and mu heavy chains or equivalents in other species. Full-length immunoglobulin “light chains” (of about 25 kDa or about 214 amino acids) comprise a variable region of about 110 amino acids at the NH₂-terminus and a kappa or lambda constant region at the COOH-terminus. Full-length immunoglobulin “heavy chains” (of about 50 kDa or about 446 amino acids), similarly comprise a variable region (of about 116 amino acids) and one of the aforementioned heavy chain constant regions, e.g., gamma (of about 330 amino acids).

The terms “antibodies” and “immunoglobulin” include antibodies or immunoglobulins of any isotype, fragments of antibodies which retain specific binding to antigen, including, but not limited to, Fab, Fv, scFv, and Fd fragments, chimeric antibodies, humanized antibodies, single-chain antibodies, and fusion proteins comprising an antigen-binding portion of an antibody and a non-antibody protein. The antibodies may be detectably labeled, e.g., with a radioisotope, an enzyme which generates a detectable product, a fluorescent protein, and the like. The antibodies may be further conjugated to other moieties, such as members of specific binding pairs, e.g., biotin (member of biotin-avidin specific binding pair), and the like. The antibodies may also be bound to a solid support, including, but not limited to, polystyrene plates or beads, and the like. Also encompassed by the term are Fab′, Fv, F(ab′)₂, and or other antibody fragments that retain specific binding to antigen, and monoclonal antibodies.

Antibodies may exist in a variety of other forms including, for example, Fv, Fab, and (Fab′)₂, as well as bi-functional (i.e. bi-specific) hybrid antibodies (e.g., Lanzavecchia et al., Eur. J. Immunol. 17, 105 (1987)) and in single chains (e.g., Huston et al., Proc. Natl. Acad. Sci. U.S.A., 85, 5879-5883 (1988) and Bird et al., Science, 242, 423-426 (1988), which are incorporated herein by reference). (See, generally, Hood et al., “Immunology”, Benjamin, N.Y., 2nd ed., 1984, and Hunkapiller and Hood, Nature, 323, 15-16, 1986).

An immunoglobulin light or heavy chain variable region consists of a framework region (FR) interrupted by three hypervariable regions, also called “complementarity determining regions” or “CDRs”. The extent of the framework region and CDRs have been precisely defined (see, “Sequences of Proteins of Immunological Interest,” E. Kabat et al., U.S. Department of Health and Human Services, 1991). The sequences of the framework regions of different light or heavy chains are relatively conserved within a species. The framework region of an antibody, that is the combined framework regions of the constituent light and heavy chains, serves to position and align the CDRs. The CDRs are primarily responsible for binding to an epitope of an antigen.

The term “chimeric antibodies” refer to antibodies whose light and heavy chain genes have been constructed, typically by genetic engineering, from antibody variable and constant region genes belonging to different species. For example, the variable segments of the genes from a mouse monoclonal antibody may be joined to human constant segments, such as gamma 1 and gamma 3. An example of a therapeutic chimeric antibody is a hybrid protein composed of the variable or antigen-binding domain from a rabbit antibody and the constant or effector domain from a human antibody, although other mammalian species may be used.

The term “humanized antibody” or “humanized immunoglobulin” refers to a non-human (e.g., mouse or rabbit) antibody containing one or more amino acids (in a framework region, a constant region or a CDR, for example) that have been substituted with a correspondingly positioned amino acid from a human antibody. In general, humanized antibodies produce a reduced immune response in a human host, as compared to a non-humanized version of the same antibody.

The terms “polypeptide” and “protein”, used interchangeably herein, refer to a polymeric form of amino acids of any length, which can include coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones.

The term “natural” antibody refers to an antibody in which the heavy and light chains of the antibody have been made and paired by the immune system of a multi-cellular organism. Spleen, lymph nodes, bone marrow and serum are examples of tissues that produce natural antibodies. For example, the antibodies produced by the antibody producing cells isolated from a first animal immunized with an antigen are natural antibodies.

The term “non-naturally paired”, with respect to VH and VL chains of an engineered antibody, refers to a VH and VL pair that is not found in a natural antibody. Thus, a non-naturally paired antibody is a combination of VH and VL chain of two different natural antibodies. The VH and VL chains of a non-naturally paired antibody are not mutated relative to the VH and VL chains of the two different antibodies which provided the VH and VL chains. For example, the “non-naturally paired” IgH and IgL chains of the engineered antibody may contain the IgH variable chain from a first antibody producing cell obtained from an animal and the IgL variable chain of second antibody producing cell obtained from the same animal, where the amino acid sequence of the antibody produced by the first cell is different from the amino acid sequence of the antibody produced by the second cell. In this example, the IgH and IgL chains may be from the same lineage group. An antibody containing “non-naturally paired” IgH and IgL chains may or not be made by phage display. As such, antibodies may or may not contain viral (e.g., bacteriophage M13)-derived sequences.

The term “lineage-related antibodies” and “antibodies that related by lineage” as well as grammatically-equivalent variants there of, are antibodies that are produced by cells that share a common B cell ancestor. Related antibodies produced by related antibody producing cells bind to the same epitope of an antigen and are typically very similar in sequence, particularly in their L3 and H3 CDRs. Both the H3 and L3 CDRs of lineage-related antibodies have an identical length and a near identical sequence (i.e., differ by up to 5, i.e., 0, 1, 2, 3, 4 or 5 residues). In certain cases, the B cell ancestor contains a genome having a rearranged light chain VJC region and a rearranged heavy chain VDJC region, and produces an antibody that has not yet undergone affinity maturation. “Naïve” or “virgin” B cells present in spleen tissue, are exemplary B cell common ancestors. Related antibodies are related via a common antibody ancestor, e.g., the antibody produced in the naïve B cell ancestor. The term “related antibodies” is not intended to describe a group of antibodies that are not produced by cells that arise from the same ancestor B-cell. A “lineage group” contains a group of antibodies that are related to one another by lineage.

The terms “treating” or “treatment” of a condition or disease refer to providing a clinical benefit to a subject, and include: (1) preventing at least one symptom of the conditions, i.e., causing a clinical symptom to not significantly develop in a mammal that may be exposed to or predisposed to the disease but does not yet experience or display symptoms of the disease, (2) inhibiting the disease, i.e., arresting or reducing the development of the disease or its symptoms, or (3) relieving the disease, i.e., causing regression of the disease or its clinical symptoms.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

One embodiment of the subject method is illustrated in FIG. 1A. With reference to FIG. 1A, this embodiment of the method may involve immunizing an antibody-producing animal with a selected antigen, and enriching from a larger population of antibody-producing cells that bind to the antigen. In FIG. 1, six different antibody producing cells A-F that produce antibodies that bind to a target antigen are enriched from a larger population of antibody producing cells. However, in many embodiments, there may be several hundred or several thousand enriched cells. Each of these cells produces a natural antibody that contains a naturally paired IgH and IgL chain. The amino acid sequences of the heavy and light chains of the antibodies produced by the enriched cells are obtained by sequencing the nucleic acids encoding the IgH and IgL chains of the antibodies, and the sequences are analyzed and put into lineage groups which, as discussed above, are groups of antibodies that are produced by cells that share a common B cell ancestor. Such antibodies generally have very similar sequences, and have H3 CDRs of identical length and near identical sequence as well as L3 CDRs of identical length and a near identical sequence. In the embodiment shown in FIG. 1A, the six antibody producing cells produce antibodies (AbA to AbF) that are in two lineage groups (i.e., lineage groups 1 and 2, where AbA and AbB are in lineage group 1 and AbC, AbD, AbE and AbF are in lineage group 2). After the antibodies have been placed into lineage groups, a single antibody (or, in certain cases, multiple antibodies (for example 2-10) from each lineage group) from at least one of the lineage group, e.g., AbA from lineage group 1 and AbC from lineage group 2, is selected for testing in a bioassay, where a bioassay identifies an antibody with a biological activity (e.g., a blocking or neutralizing activity). Once an antibody having a biological activity has been identified, e.g., AbC, other antibodies from the same lineage group as the identified antibody are tested to identify a second antibody that has the same biological activity as the first antibody. In the example shown in FIG. 1, antibodies D, E and F, which belong to the same lineage group as antibody C, were tested.

Many warm-blooded animals, in particular mammals such as humans, rabbits, mice, rats, sheep, cows, pigs and ayes such as chickens and turkeys, may be used as a source of antibody-produced cells. However, in certain embodiments a rabbit or mice is used because of their ease in handling, well-defined genetic traits, and the fact that they may be readily sacrificed. Procedures for immunizing animals are well known in the art, and are described in Harlow et al., (Antibodies: A Laboratory Manual, First Edition (1988) Cold Spring Harbor, N.Y.).

Suitable antigens include extracellularly-exposed fragments of Her2, GD2, EGFR, CEA, CD52, CD20, Lym-1, CD6, complement activating receptor (CAR), EGP40, VEGF, tumor-associated glycoprotein TAG-72 AFP (alpha-fetoprotein), BLyS (TNF and APOL—related ligand), CA125 (carcinoma antigen 125), CEA (carcinoembrionic antigen), CD2 (T-cell surface antigen), CD3 (heteromultimer associated with the TCR), CD4, CD11a (integrin alpha-L), CD14 (monocyte differentiation antigen), CD20, CD22 (B-cell receptor), CD23 (low affinity IgE receptor), CD25 (IL-2 receptor alpha chain), CD30 (cytokine receptor), CD33 (myeloid cell surface antigen), CD40 (tumor necrosis factor receptor), CD44v6 (mediates adhesion of leukocytes), CD52 (CAMPATH-1), CD80 (costimulator for CD28 and CTLA-4), complement component C5, CTLA, EGFR, eotaxin (cytokine A11), HER2/neu, HLA-DR, HLA-DR10, HLA Class II, IgE, GPiib/iiia (integrin), Integrin aVβ3, Integrins a4β1 and a4β7, Integrin β2, IFN-gamma, IL-1β, IL-4, IL-5, IL-6R (IL6 receptor), IL-12, IL-15, KDR (VEGFR-2), lewisy, mesothelin, MUC1, MUC18, NCAM (neural cell adhesion molecule), oncofetal fibronectin, PDGFβR (Beta platelet-derived growth factor receptor), PMSA, renal carcinoma antigen G250, RSV, E-Selectin, TGFbeta1, TGFbeta2, TNFalpha, TRAIL-R1, VAP-1 (vascular adhesion protein 1) or TNFα, or the like. In many embodiments, a peptide having the amino acid sequence corresponding to a portion of an extracellular domain of one of the above-listed proteins is employed as an antigen.

Antibody-producing cells may also be obtained from a subject which has generated the cells during the course of a selected disease or condition. For instance, antibody-producing cells from a human with a disease of unknown cause, such as rheumatoid arthritis, may be obtained and used in an effort to identify antibodies which have an effect on the disease process or which may lead to identification of an etiological agent or body component that is involved in the cause of the disease. Similarly, antibody-producing cells may be obtained from subjects with disease due to known etiological agents such as malaria or AIDS. These antibody-producing cells may be derived from the blood, lymph nodes or bone marrow, as well as from other diseased or normal tissues. Antibody-producing cells may also be prepared from blood collected with an anticoagulant such as heparin or EDTA. The antibody-producing cells may be further separated from erythrocytes and polymorphs using standard procedures such as centrifugation with Ficoll-Hypaque (Pharmacia, Uppsula, Sweden). Antibody-producing cells may also be prepared from solid tissues such as lymph nodes or tumors by dissociation with enzymes such as collagenase and trypsin in the presence of EDTA.

In exemplary embodiments, an affinity purification method is utilized to isolate antibody producing cells that produce antibodies that bind to an antigen. The antigen with which the animal was immunized may be immobilized on a solid phase and used to selectively retain antibody producing cells that express an antibody on their surface that binds to the antigen, while other cells are washed away. The retained cells may then be eluted by a variety of methods, such as by using an excess of the antigen, chaotropic agents, changing the pH, salt concentration, etc. Any of the well known methods for immobilizing or coupling antigen to a solid phase may be used. For example, when the antigen is a cancer cell, appropriately treated microtiter plate that will bind to cells may be used, such as microtiter plates for cell culture. In the instances where the antigen is a protein, the protein may be covalently attached to a solid phase, for example, sepharose beads, by well known techniques, etc. Alternatively, a labeled antigen may be used to specifically label cells that express an antibody that binds to the antigen and the labeled cells may then be isolated by cell sorting (e.g., by FACS). In certain cases, methods for antibody purification may be adapted to isolate antibody producing cells. Such methods are well known and are described in, for example, J Immunol Methods. 2003 November; 282(1-2):45-52; J Chromatogr A. 2007 Aug. 10; 1160(1-2):44-55; J Biochem Biophys Methods. 2002 May 31; 51 (3):217-31. Cells may also be isolated using magnetic beads or by any other affinity solid phase capture method, protocols for which are known. In some embodiments, antigen-specific antibody producing cells may be obtained from blood by flow cytometry using the methods described in Wrammert (Nature 2008 453: 667-672), Scheid (Nature 2009 458: 636-640), Tiller (J. Immunol. Methods 2008 329 112-124) or Scheid (Proc. Natl. Acad. Sci. 2008 105: 9727-9732), for example, which are incorporated by reference for disclosure of those methods. Exemplary antibody-producing cell enrichment methods include performing flow cytometry (FACS) of cell populations obtained from a spleen, bone marrow, lymph node or other lymph organs, e.g., through incubating the cells with labeled anti-rabbit IgG and sorting the labeled cells using a FACSVantage SE cell sorter (Becton-Dickinson, San Jose, Calif.). In some embodiments, single or nearly single antibody-producing cells are deposited in microtiter plates. If the FACS system is employed, sorted cells may be deposited after enrichment directly into a microtiter plate.

Enrichment may decrease the size of the cell population by at least 50%, e.g., at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or at least 99% and in certain cases, the plurality of enriched antibody producing cells may be substantially pure, i.e., substantially free of other cells that do not produce an antibody that binds to the antigen, where the term “substantially pure” refers to an isolated population of antibody producing cells, in which cells that express antibodies that specifically bind to the antigen make up at least 5%, 10%, 20%, 30%, at least 40%, at least 50%, at least 60%, at least 70% or more of the total population of cells. The enriched population of antibody producing cells may be employed as a mixture of cells, or alternatively, they may be used as single cells, e.g., by dilution and deposition into individual wells of a microtiter plate.

The enriched population of antibody producing cells may comprise at least 5, at least 10, at least 30, at least 60, at least 100, at least 300, at least 500, at least 1000, at least 5,000, at least 10,000 or at least 100,000, or more antibody producing cells.

The isolated antibody-producing cells may be optionally cultured (i.e. grown in media that supports at least one, at least 5 or at least 10 or more cell divisions of the cell) by methods known to one of skill in the art after they have been deposited (see e.g. WO 01/55216).

In certain embodiments, the antibodies produced by the enriched cells are not well characterized. As such, although the antibody-producing cells are isolated based on the production of antibodies that specifically bind to the antigen, the epitope(s) to which these antibodies bind is unknown, and it is not known if the antibodies have any biological activity (e.g., a neutralizing or blocking activity). Additionally, the nucleic acid sequence or the amino acid sequence of the variable regions of IgH and IgL chains of these antibodies are not known.

Sequences encoding heavy and light chains may be amplified from the cDNA using techniques well known in the art, such as Polymerase Chain Reaction (PCR). See Mullis, U.S. Pat. No. 4,683,195; Mullis et al., U.S. Pat. No. 4,683,195; Polymerase Chain Reaction: Current Communication in Molecular Biology, Cold Springs Harbor Press, Cold Spring Harbor, N.Y., 1989. Briefly, cDNA segments encoding the variable domain of the antibody are exponentially amplified by performing sequential reactions with a DNA polymerase. The reaction is primed by a 5′ primer and a 3′ DNA primer. In some embodiments, the 3′ antisense primer corresponding to a DNA sequence in the constant (or joining) region of the immunoglobulin chain and the 5′ primer (or panel of related primers) corresponding to a DNA sequence in the variable region of the immunoglobulin chain. This combination of oligonucleotide primers has been used in the PCR amplification of murine immunoglobulin cDNAs of unknown sequence (see Sastry et at., Proc Natl. Acad. Sci. 86:5728-5732, 1989 and Orlandi et al., Proc. Natl. Acad. Sci. 86:3833-3837, 1989). Alternatively, an “anchored polymerase chain reaction” may be performed (see Loh et al., Science 243:217-220, 1989). In this procedure, the first strand cDNA is primed with a 3′ DNA primer as above, and a poly(dG tail) is then added to the 3′ end of the strand with terminal deoxynucleotidyl transferase. The product is then amplified by PCR using the specific 3′ DNA primer and another oligonucleotide consisting of a poly(dC) tail attached to a sequence with convenient restriction sites. In many embodiments, however, the entire polynucleotide encoding a heavy or light chain is amplified using primers spanning the start codons and stop codons of both of the immunoglobulin cDNAs, however, depending on the amplification products desired, suitable primers may be used. Exemplary primers for use with rabbit antibody-producing cells are as follows: heavy chain, 5′ end (CACCATGGAGACTGGGCTGCGCTGGCTTCTCCTGGTCGCTGTG; SEQ ID NO:177); heavy chain, 3′ end (CTCCCGCTCTCCGGGTAAATGAGCGCTGTGCCGGCGA; SEQ ID NO:178); light chain kappa, 5′end (CAGGCAGGACCCAGCATGGACACGAGGGCCCCCACT; SEQ ID NO:179); and L kappa, 3′end (TCAATAGGGGTGACTGTTAGAGCGAGACGCCTGC; SEQ ID NO:180). Suitable restriction sites and other tails may be engineered into the amplification oligonucleotides to facilitate cloning and further processing of the amplification products. Amplification procedures using nested primers may also be used, where such nested primers are well known to one of skill in the art. Exemplary methods for amplifying antibody-encoding nucleic acid is also described in Wrammert (Nature 2008 453: 667-672) and Scheid (Nature 2009 458: 636-640), for example. In this embodiment, the enriched cells may be combined before sequencing (in which case the initial amplification product will contain a mixture of a plurality of different products that can be discriminated by cloning the products or using single molecule sequencing technologies), or the cells may be kept separate from one another (in which case the initial amplification product amplified from a single cell may contain a single species that can be sequenced).

In certain embodiments, at least 1,000 heavy chain sequences and at least 1,000 light chain sequences are obtained.

Once the amino acid sequence of heavy and light chains of the antibodies has been obtained, antibodies can be grouped on the basis of sequence similarity to provide a plurality of groups of antibodies that are related by lineage. Methods for performing clonal analysis of antibody sequences are well known and are described in a number of publications including Magori-Cohen (Bioinformatics 2006 22: e332-40), Manske (Clin. Immunol. 2006 120:106-20), Kleinstein (J. Immunol. 2003 171: 4639-49), Clement (Mol. Ecol. 2000 9: 1657-1659), Mehr (J. Immunol. 2004 172 4790-6), Wrammert (Nature 2008 453: 667-672), Scheid (Nature 2009 458: 636-640), which are incorporated by reference herein for disclosure of those methods. The antibodies placed into lineage groups should all be from a single animal, i.e., an individual mouse or rabbit.

In some embodiments, the amino acid positions of an antibody are numbered using a suitable numbering system, such as that provided by Chothia (J Mol Biol 1998; 278: 457-79) or Kabat (1991, Sequences of Proteins of Immunological Interest, DHHS, Washington, DC). CDR and/or framework residues may be identified using these methods. The numbered sequences may be aligned by eye, or by employing an alignment program such as one of the CLUSTAL suite of programs (Thompson et al Nucleic Acids Research, 22:4673-4680). The variable regions of antibodies within a related group of antibodies have amino acid sequences that are very similar. For example, the VH or VL domains of antibodies within a related group of antibodies may have amino acid sequences that are at least about 80% identical (e.g., at least 85% identical, at least 90% identical, at least 95% or at least 98% or at least 99% identical), ignoring any gaps or insertions made to facilitate alignment of the sequences. Antibodies within a related group of antibodies have a VL domains that are similar to each other, as well as VH domains that are similar to each other. In other words, in certain embodiments the VH or VL domains of two different related antibodies usually contain up to about ten (i.e., one, two, three, four or five or more) amino acid differences. An amino acid difference may be present at any position of the variable domain, including in any CDR or in any framework region. Certain related antibodies have H3 CDRs that are almost identical, as well as L3 CDRs that are almost identical. In these embodiments, any two antibodies that are related will have L3 and H3 CDRs that are each identical in length and have near identical sequences (i.e., that contain 0, 1, 2, 3, 4 or 5 amino acid changes). In other words the L3 CDRs of the two antibodies are identical in length and near identical in sequence and the H3 CDRs of the two antibodies are identical in length and near identical in sequence. Two exemplary sets of related antibodies are shown in FIG. 2, and the sequences of 20 exemplary VH3 regions of unrelated rabbit antibodies are shown for comparison in FIG. 3.

In certain embodiments, the heavy chain sequences may or may not be grouped independently of the light chain sequences. If the heavy and light chain sequences are grouped independently of one another, the heavy and light chain groups may be matched up by analysis of lineage trees.

Depending how many sequences are obtained, in certain embodiments the enriched antibodies may be grouped into at least 5 groups, at least 10 groups, at least 20 groups, at least 50 groups, or at least 100 groups or more, e.g., up to 200 or 500 groups or more. Depending how many sequences are obtained, each group may contain from 2 to several hundred or more antibodies.

Once the antibodies have been grouped, a single antibody from each of at least some of the groups (e.g., at least 20%, at least 50 or at least 80% of the groups) is tested in a first bioassay to identify a first antibody that has a biological activity. The bioassay may determine whether the antibody has a biological effect, e.g., an ability to inhibit an interaction between a receptor and an a ligand by either binding to the receptor and blocking binding of the ligand, or by binding to the ligand and neutralizing it, or by promoting or inhibiting a cellular phenotype, e.g., cell growth, cell proliferation, cell migration, cell viability (e.g., apoptosis), cell differentiation, cell adherence, cell shape changes (e.g., tubular cell formation), complement dependant cytotoxicity CDC, antibody-dependent cell-mediated cytotoxicity ADCC, receptor activation, gene expression changes, changes in post-translational modification (e.g., phosphorylatoin), changes in protein targeting (e.g., NFκB localization etc.), etc., or inhibition of receptor multimerization (e.g., dimer or trimerization) or receptor-ligand interactions, etc. Such bioassays are well known in the art. The term “bioassay” is intended to exclude assays in which only the ability of an antibody to bind to a target is read. Bioassays useful in this method are numerous, and include but are not limited to cellular assays in which a cellular phenotype is measured, e.g., gene expression assays; and in vivo assays that involve a particular animal (which, in certain embodiments may be an animal model for a condition related to the target). In certain cases, the assay may be a vascularization assay.

In this embodiment, the antibodies tested in the bioassay may contain naturally paired heavy and light chain variable domains, or non-naturally paired heavy and light chains (i.e., heavy and light chain variable domains from different antibodies of the same lineage group). Since the antibodies are from the same lineage group, it is expected that such antibodies will be functional.

After a first antibody that has a biological activity has been identified, further antibodies that are in the same lineage group as the first antibody are tested in a second bioassay, thereby identifying a second antibody that has the same biological activity as the first antibody. In certain cases at least 10%, at least 20%, at least 50%, or at least 80% of the antibodies in the same lineage group are tested. The first bioassay may be the same as or different to the second bioassay. In certain embodiments, a plurality of antibodies is tested, and the antibody with the best properties is chosen for future use.

In particular embodiments, the further antibodies may contain naturally paired heavy and light chain variable domains, or non-naturally paired heavy and light chain variable domains (i.e., heavy and light chain variable domains from different antibodies of the same lineage group). Since the antibodies are from the same lineage group, it is expected that such antibodies will be functional. In particular embodiments, the pairing of the heavy and light chains may be systematic (e.g., every heavy chain is tested in combination with every light chain) or random (e.g., every heavy chain is tested with randomly selected light chains), for example.

Exemplary VEGF bioassays include assays using isolated protein in a cell free systems, in vitro using cultured cells or in vivo assays. Exemplary VEGF assays include, but are not limited to a receptor tyrosine kinase inhibition assay (see, e.g., Cancer Research Jun. 15, 2006; 66:6025-6032), an in vitro HUVEC proliferation assay (FASEB Journal 2006; 20:2027-2035), an in vivo solid tumor disease assay (U.S. Pat. No. 6,811,779) and an in vivo angiogenesis assay (FASEB Journal 2006; 20:2027-2035). These assays are well known in the art. The descriptions of these assays are hereby incorporated by reference.

Exemplary TNF-α bioassays include in vitro assays using cell free systems or using cultured cells or in vivo assays. As such, TNF-α assays include in vitro human whole blood assay and cell mediated cytotoxicity assay (U.S. Pat. No. 6,090,382), in vitro tumor human killing assay (see, e.g., published U.S. patent application 20040185047), in vivo tumor regression assay (U.S. patent application 20040002589). Additional TNF-α assays are described in a variety of publications, including 20040151722, 20050037008, 20040185047, 20040138427, 20030187231, 20030199679, and Balazovich (Blood 1996 88: 690-696).

A subject antibody inhibits at least one activity of its target in the range of about 20% to 100%, e.g., by at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, usually up to about 70%, up to about 80%, up to about 90% or more. In certain assays, a subject antibody may inhibits its target with an IC₅₀ of 1×10⁻⁷ M or less (e.g., 1×10⁻⁷ M or less, 1×10⁻⁸ M or less, 1×10⁻⁹ M or less, usually to 1×10⁻¹² M or 1×10⁻¹³ M). In assays in which a mouse is employed, a subject antibody may have an ED₅₀ of less then 1 μg/mouse (e.g., 10 ng/mouse to about 1 μg/mouse). In certain embodiments, a subject antibody may be contacted with a cell in the presence of a ligand, and a ligand response phenotype of the cell is monitored.

In certain embodiments, particularly if the antigen elicits a strong response in the animal, the method may be practiced in the absence of any antigen-based enrichment of antibody producing cells prior to the first bioassay. In these embodiments, the method may involve: a) obtaining the antibody heavy chain sequences and the antibody light chain sequences from a population of B cells of an animal, wherein said population of B cells is not enriched for B cells that produce antibodies that specifically bind to a target antigen, b) grouping the heavy and light chain sequences on the basis of sequence similarity to provide a plurality of groups of antibodies that are related by lineage; c) testing a single antibody from each of the groups in a first bioassay to identify a first antibody that has a biological activity; and, after the first antibody has been identified; and d) testing further antibodies that are in the same group as the first antibody in a second bioassay, thereby identifying a second antibody that has the biological activity.

Another embodiment of is illustrated in FIG. 1B. With reference to FIG. 1B, this embodiment of the subject method may involve immunizing an antibody-producing animal with a selected antigen, and testing a plurality of antibodies produced by a first portion of an antibody producing organ of the animal (e.g., a first portion of the spleen, a first portion of the lymph nodes, a first portion of bone marrow, or a first portion of the peripheral blood mononuclear cell (PBMC) population in the bloodstream of the animal, etc.) in a bioassay to identify a first antibody that has a biological activity. In this embodiment, the first and second portions of an antibody-producing organ need not be spatially separated in the organ. Rather, since a first portion or an organ can be made by, for example, making a single cell suspension of the organ and then removing part of the suspension, the first and second portions of an organ may be interspersed with one another in the organ. In the example shown in FIG. 1B, antibody A is identified as having a biological activity. The nucleotide sequence encoding the IgH and IgL chain of the antibody is obtained. Based on these sequences, PCR primers that are specific for the heavy and light chains of antibodies that are in the same lineage group as the identified antibody are designed, and used to obtain from the second portion of the antibody producing organ the sequences of further antibodies that are in the same lineage group as the identified antibody. The further antibodies are tested, and a second antibody from the same lineage group and also having the same biological activity as the first antibody is identified.

Many exemplary aspects of this alternative method, e.g., which antigens and bioassays can be employed in the method, etc., are discussed above. In certain embodiments, a lead antibody obtained from a first portion of an antibody-producing organ is identified using a bioassay. In this embodiment, the antibodies obtained from the first portion of the organ are screened using a hybridoma-based method or by a method that does not require production of hybridomas, e.g., by phage display or by the method described in US20040067496 and other references, for example, to identify a biologically active antibody. In one embodiment, a portion of the splenocytes of a spleen of a single animal is fused with a fusion partner to produce hybridomas that are then screened to identify a biologically active antibody. In another embodiment, heavy and light chain sequences are directly amplified from PBMCs, and recombinant antibodies are expressed in a different cell (e.g., as described in US20040067496) prior to screening. In another embodiment, a phage display library is constructed from the RNA made from a portion of the spleen of an animal, and the phage display library is screened. The first, biologically active antibody is identified, and the nucleic acid encoding that antibody is sequenced.

In certain embodiments, polynucleotides encoding the variable heavy and variable light domains of lineage-related antibodies may be amplified from the same animal as the first antibody by “CDR-anchored PCR”, i.e., using pairs of primers that each contains a primer that is complementary to a CDR-encoding region of the parent antibody cDNA. In these embodiments, the method may include: a) obtaining the nucleotide sequences of: i. a heavy chain-encoding nucleic acid that encodes the variable heavy chain of a first antibody of an immunized animal; and ii. a variable light chain-encoding nucleic acid that encodes the light chain of the first antibody; b) obtaining the amino acid sequence of the variable domains of the heavy and light chains of further antibodies from the immunized animal, using: i. a first primer pair that includes a first primer that is complementary to a CDR-encoding region of the heavy chain-encoding nucleic acid; and ii. a second primer pair that includes a second primer that is complementary to a CDR-encoding region of the light chain-encoding nucleic acid. After the amino acid sequences of the variable domains of the further antibodies have been determined by translation of the obtained nucleotide sequences, the amino acid may be analyzed using the above methods to confirm that they are related by lineage to the first antibody (e.g., analyzed to determine whether the amino acid sequences of the heavy and light chains are at least 80% identical to those of the parent antibody and whether the heavy and light chain CDR3 regions are of identical length of near identical sequence etc. as discussed above).

As would be readily apparent, a variety of techniques are available for amplifying sequences that encode further antibodies from an animal after the nucleotide sequence encoding a first antibody has been obtained from that animal. For example, sequences encoding the heavy and light chains of the second antibody may be amplified using inverse PCR (e.g., using two primers that face away from each other) or by anchored PCR using a specific (where a specific primer may be complementary to a different sequence of the first antibody, e.g., a different CDR sequence) or “universal” primer (where a universal primer is complementary to a sequence that is present in a plurality of different antibody-encoding polynucleotides), where one of the primers is complementary to first CDR-encoding region using cDNA as a template. In certain cases, a universal primer may be complementary to a sequence that is in at least 10% (e.g., at least 20% at least 40% at least 50% or at least 80%) of all heavy or light chain encoding cDNAs obtainable from the animal (e.g., complementary to nucleic acid encoding a conserved sequence that is present in the constant region or secretion signal of the antibodies). In other embodiments, the universal primer may be complementary to flanking sequences in the vector into which cDNA from the animal is cloned or to linkers ligated onto the cDNA, for example.

In one embodiment, two amplification reactions are performed using cDNA as a template, where the first reaction amplifies the heavy chain variable domain-encoding nucleic acid for the second antibody and the second reaction amplifies the light chain variable domain-encoding nucleic acid for the second antibody. In this embodiment: a) the first reaction uses: i. a CDR-specific primer that is complementary to a CDR-encoding region (i.e., the CDR1, CDR2 or CDR3 region) of the heavy chain-encoding nucleic acid of the first antibody and ii. a universal second primer that is complementary to a non-variable domain-encoding region of the antibody heavy chain cDNA, e.g., to a sequence that encodes the constant domain or secretion signal of the heavy chain of the first antibody, as illustrated in the examples section of this disclosure; and b) the second reaction uses i. a CDR-specific primer that is complementary to a CDR-encoding region (i.e., the CDR1, CDR2 or CDR3 region) of the light chain-encoding nucleic acid of the first antibody and ii. a universal second primer that is complementary to a non-variable domain-encoding region of the antibody light chain cDNA, e.g., to a sequence that encodes the constant domain or secretion signal of the light chain of the first antibody, as illustrated in the examples section of this disclosure.

Several strategies for cloning antibody sequences by PCR are known and may be readily adapted for use in the instant method (e.g., by using a CDR-specific primer in addition to a disclosed primer). Such strategies include those described by: LeBoeuf (Cloning and sequencing of immunoglobulin variable-region genes using degenerate oligodeoxyribonucleotides and polymerase chain reaction. Gene. 1989 82:371-7), Dattamajumdar (Rapid cloning of any rearranged mouse immunoglobulin variable genes Immunogenetics. 1996 43:141-51), Kettleborough (Optimization of primers for cloning libraries of mouse immunoglobulin genes using the polymerase chain reaction Eur. J. Immunol. 1993 23:206-11), Babcook (A novel strategy for generating monoclonal antibodies from single, isolated lymphocytes producing antibodies of defined specificities Proc. Natl. Acad. Sci. 1996 93: 7843-7848) and Williams (Structural diversity in domains of the immunoglobulin superfamily. Cold Spring Harb. Symp. Quant. Biol. 1989 54:637-47) as well as many others. In certain cases, the second primer may be a mixture of different primers or degenerate primers, for example.

The heavy chain CDR-specific primer may be complementary to the sequence that encodes the CDR1, CDR2 or CDR3 region of the heavy chain of the first antibody and, likewise, the light chain CDR-specific primer may be complementary to the sequence that encodes the CDR1, CDR2 or CDR3 region of the light chain of the first antibody. In certain embodiments, a particular CDR-specific primer may be chosen because the CDR sequence to which it binds may be known to be less variable than other CDR sequences.

Such CDR-anchored amplification method described in U.S. patent application Ser. No. 61/151,052, filed Feb. 9, 2009, which is incorporated by reference in its entirety for disclosure of those methods.

The above-described CDR-anchored method is effective because most sequence diversity between the variable domains in different families of antibodies that are related by lineage is in the CDR regions (i.e., the CDRs are quite variable between different families of antibodies), whereas the sequence of the CDR regions is relatively constant within the antibodies of a single family of antibodies that are related by lineage. Because the method uses primers that are complementary to sequence that are highly variable between different families of related antibodies, only related antibodies should be successfully amplified by the method.

In this embodiment, an amplification reaction may be performed using cDNA made from a second portion of the antibody-producing organ. For example, the amplification reaction may be done using nucleic acid obtained from single cells (or cultures of the same) or nucleic acid obtained from pooled cells (e.g., pools of different antibody-producing cells that each contain cDNA). Pools may contain cDNA from at least 10, at least 100 or at least 1,000 different antibody cells, for example. In embodiments in which hybridomas are used, the identity of the hybridomas that contributed to each pool may be tracked in order to identify a hybridoma producing a second antibody if the sequence encoding the second antibody is successfully amplified. Amplification products of the expected size may be sequenced directly or cloned and sequenced using known methods.

Depending on the antigen and number of antibody-producing cells in the second portion of the antibody-producing organ, the heavy and light chain variable sequences for at least 5, at least 10, at least 20, at least 50 or at least 100 or more, e.g., up to 200, up to 500, 1,000, 5,000 or 10,000 or more sequences may be obtained.

The further antibodies are tested in a second bioassay to identify a second antibody that has the same biological activity as the first antibody. As noted above, the first and second bioassays may be the same or different. In certain cases at least 30% (e.g., at least 70%, at least 80%, or at least 90%) of the lineage-related antibodies are tested in the bioassay. In this embodiment, the further antibodies may contain naturally paired heavy and light chain variable domains, or non-naturally paired heavy and light chains (i.e., heavy and light chain variable domains from different antibodies of the same lineage group). Since the antibodies are from the same lineage group, it is expected that such antibodies will be functional. In particular embodiments, the pairing of the heavy and light chains may be systematic (e.g., every heavy chain is tested in combination with every light chain) or random (e.g., every heavy chain is tested with randomly selected light chains), for example.

An antibody produced by the instant methods finds use in diagnostics, in antibody imaging, and in treating diseases treatable by monoclonal antibody-based therapy. In particular, an antibody humanized by the instant methods may be used for passive immunization or the removal of unwanted cells or antigens, such as by complement mediated lysis or antibody mediated cytotoxicity (ADCC), all without substantial immune reactions (e.g., anaphylactic shock) associated with many prior antibodies. For example, the antibodies of the present invention may be used as a treatment for a disease where the surface of an unwanted cell specifically expresses a protein recognized the antibody (e.g. HER2, or any other cancer-specific marker) or the antibodies may be used to neutralize an undesirable toxin, irritant or pathogen. Humanized antibodies are particularly useful for the treatment of many types of cancer, for example colon cancer, lung cancer, breast cancer prostate cancer, etc., where the cancers are associated with expression of a particular cellular marker. Since most, if not all, disease-related cells and pathogens have molecular markers that are potential targets for antibodies, many diseases are potential indications for humanized antibodies. These include autoimmune diseases where a particular type of immune cells attack self-antigens, such as insulin-dependent diabetes mellitus, systemic lupus erythematosus, pernicious anemia, allergy and rheumatoid arthritis; transplantation related immune activation, such as graft rejection and graft-vs-host disease; other immune system diseases such as septic shock; infectious diseases, such as viral infection or bacteria infection; cardiovascular diseases such as thrombosis and neurological diseases such as Alzeimer's disease.

An antibody of particular interest is one that modulates, i.e., reduces or increases a symptom of the animal model disease or condition by at least about 10%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 80%, at least about 90%, or more, when compared to a control in the absence of the antibody. In general, a monoclonal antibody of interest will cause a subject animal to be more similar to an equivalent animal that is not suffering from the disease or condition. Monoclonal antibodies that have therapeutic value that have been identified using the methods and compositions of the invention are termed “therapeutic” antibodies.

EXAMPLES

The following examples are provided in order to demonstrate and further illustrate certain embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof.

Example 1 Method of Producing a Library of Engineered-Antibody Producing Cells

Isolation of Antibody Producing Cells

Rabbits are immunized with an antigen using a standard immunization protocol. At about 10 days after the second booster immunization, antibody titers are determined using ELISA. Two booster immunizations are usually sufficient for obtaining high antibody titers. As soon as a high titer (detectable signal at 1:100000 dilution) is observed, the rabbit is sacrificed and bone marrow cells are collected from the femur and/or other large bones. Spleen cells and peripheral blood mononuclear cells (PBMCs) are also collected and frozen in 10% DMSO/90% FBS for analysis at a later time. Very large numbers of bone marrow cells (>2 billion) are obtained from a single rabbit. After washing, clearing of debris, and red-cell lysis, the antibody producing cells, which bind to the antigen with which the rabbit was immunized, are purified using FACS. Briefly, the antigen is conjugated to a fluorescent dye and the labeled antigen is incubated with the cells obtained above. The cells are briefly rinsed to wash off any antigen non-specifically attached to the cell. After rinsing, fluorescent cells are separated from unlabeled cells using FACS. These fluorescent cells express antibodies on their surface that specifically binds to the antigen with which the animal was immunized.

RT-PCR to Obtain IgH and IgL Chain cDNA

Primer Design:

In rabbit, the 5′ coding sequences of rabbit immunoglobulin heavy chain are primarily derived from only one gene. Antibody diversity is created by gene conversion and somatic mutation, but this does not affect the 5′ end of the antibody cDNA. Thus, most rabbit IgG H chains have very similar or identical signal peptide sequences, and the same is true for L chains. On the 3′ side, primers hybridizing to the constant domains, which also have identical sequences in most rabbit antibodies (rabbit constant domains are not divided into subclasses). As a result, only one pair of primers each is required for amplifying the vast majority of rabbit IgG H and L sequences. Typical priming sites are shown below, although any primer sites are used so long as the a variable domain-encoding polynucleotide is amplified. Typical primers for use with rabbit antibody-producing cells are as follows: heavy chain, 5′ end (CACCATGGAGACTGGGCTGCGCTGGCTTCTCCTGGTCGCTGTG (SEQ ID NO: 181)); heavy chain, 3′ end (CTCCCGCTCTCCGGGTAAATGAGCGCTGTGCCGGCGA (SEQ ID NO: 182)); light chain kappa, 5′end (CAGGCAGGACCCAGCATGGACACGAGGGCCCCCACT (SEQ ID NO: 183)); and L kappa, 3′end (TCAATAGGGGTGACTGTTAGAGCGAGACGCCTGC (SEQ ID NO: 184)).

Note that the 3′ H chain primer spans the 3′ end of the coding region, the stop codon, and the beginning of the 3′ UTR. Thus, this primer is specific for the secreted form of IgG, and does not recognize the transmembrane form, which does not contain this sequence due to alternative splicing. Therefore, the method is unlikely to recover IgG from memory B cells, which express predominantly the transmembrane form.

RT-PCR Conditions:

Cell lysis is done heating in a buffer containing RNAse inhibitors, followed by DNA degradation and reverse transcription performed at high temperature (60° C.) using a thermostable reverse transcriptase. Reverse transcription is primed by primers specific for the 3′ region of the IgG mRNAs. A single-step RT-PCR protocol is used, utilizing a thermostable enzyme that has both reverse transcriptase and DNA polymerase activities (MasterAmp™ RT-PCR Kit for High Sensitivity, Epicentre Technologies, Madison, Wis.). PCR products are analyzed by agarose gel electrophoresis. If required, a second round of PCR is performed with nested primers. In some PCR applications, this step is required to produce sufficient amounts of specific product.

Co-Amplification of H and L Chain cDNAs:

Different combinations of primers are tried, to accomplish efficient PCR amplification of H and L chain cDNAs in the same reaction. A ‘head start’ approach is often used, where PCR cycling is started with H chain primers alone; after a number of cycles (5 to 10) the L chain primers are added to the mix. Using these methods, similar yields of H and L chain are produced. Alternatively, a nested PCR approach is used for the H chain, by performing an initial round of PCR with primers amplifying the full-length cDNA, and a second round with primers amplifying only the vH-cH1-hinge portion of the H chain. This method should yield a product similar in size to the L chain cDNA. Expression of this product yields the F(ab′)2 fragment of IgG, which is divalent and fully active for antigen-binding.

IgG heavy and light chain PCR products are joined with CMV promoter and BGH3′pA (bovine growth hormone polyadenylation/transcription termination) sequences.

Method a) Overlap Extension PCR.

CMV Promoter Segment:

To prepare the CMV promoter fragment, the expression vector pcDNA-3 (which contains the CMV promoter and BGH3′pA segments) is used as a template, and the following PCR setup:

Primer 1 (5′ AATTCACATTGATTATTGAG 3′; SEQ ID NO: 185) corresponding to the 5′ end of the CMV promoter;

Primer 2 (5′ CAGCGCAGCCCAGTCTCCATCCCGTAAGCAGTGGGTTCTC 3′; SEQ ID NO: 186) corresponding to the 3′ end of the CMV promoter, and containing a 5′ extension (underlined) complementary to the 5′ end of the rabbit Ig H signal peptide sequence is performed.

PCR amplification with these primers produces a linear DNA fragment consisting of the CMV promoter (610 nt) and a 20 nt extension on the 3′ end, which is complementary to the 5′ end of the IgG vH coding region. As mentioned above, most rabbit IgGs contain 5′ vH (signal peptide) regions with nearly identical sequences. Therefore, only one primer pair is needed to amplify the majority of rabbit IgG cDNAs.

BGH3′ pA Segment.

A similar approach is used to prepare the BGH3′pA segment. Again, the pcDNA3 expression vector is used as a template, and the following primers are used:

Primer 3 (5′ CCGGGTAAATGAGCGCTGTGGTTTAAACCCGCTGATCAGC 3′; SEQ ID NO: 187), corresponding to 5′ end of the BGH3′pA domain extended by a 20 nt sequence complementary to the 3′ end of the IgG heavy chain coding region, and including 11 nt of the 3′ untranslated domain.

Primer 4 (5′ AAGCCATAGAGCCGACCGCA 3′; SEQ ID NO: 188) corresponding to the 3′ end of the BGH polyadenylation domain.

PCR amplification results in a 250 nt fragment containing the BGH3′pA sequence and a 20 nt extension that overlaps with the 3′ end of the IgG heavy chain sequence.

Overlap Extension PCR:

The IgG heavy chain PCR product are mixed with the CMV promoter and BGH3′pA segments. The mixture is subjected to 10 cycles of PCR. The overlapping segments anneal, followed by extension of the overlapping 3′ ends. At the end of the 10 cycles, the outside primers (primers 1 and 4) are added to the mixture, and another 30 cycles of PCR are performed. The product is a 2100 nt fragment consisting of the CMV promoter, the IgG H coding sequence, and the BGH terminator.

IgG Light Chain:

The process are carried out in an analogous manner to produce 1500 nt fragments consisting of CMV promoter, kappa light chain coding sequence, and BGH terminator. A separate set of primers for lambda light chains can also be used to amplify and clone lambda light chains.

A low concentration of primers in the initial PCR reaction may be used. In some embodiments, primers are be designed such that amplification of the heavy chain results in a nucleotide encoding a form of the IgG H chain that is truncated at the 3′ end of the hinge domain. This fragment would be similar in size to the v kappa light chain. Co-expression of these fragments results in the secretion of F (ab′)₂ fragments of IgG.

Method b) Topoisomerase I Coupling.

This method is used as an alternative to overlap extension PCR. The overall experimental strategy is as described above. Commercially available topoisomerase-modified CMV promoter and BGH3′pA segments will be used (Invitrogen, San Diego, Calif.). The CMV promoter element (610 nt) is provided in a modified form with the topoisomerase recognition site (CCCTT) at its 3′ end, and a six base pair single-stranded overhang at the 3′ end (GCCTTG) which is used for directional coupling with the PCR product. The topoisomerase I enzyme is bound to the recognition site CCCTT. In order to be joined to the Topo-modified CMV promoter, the PCR product needs to contain the sequence CGGAACAAGGG (SEQ ID NO: 189) at its 5′end. This sequence is cleaved by topoisomerase, resulting in a 6-base single-strand overhang that is complementary to the single-strand overhang of the CMV promoter element. These overhangs anneal and the fragments are covalently joined by the enzyme.

In order to link the IgG cDNA fragment to the CMV promoter, the 5′ primer used in the last round of IgG amplification are extended at its 5′ end with the sequence CGGAACAAGGG (SEQ ID NO: 190).

The linkage of the 3′ end of the IgG fragment with the BGH3′pA element is performed in an analogous manner, except that a different single-stranded overhang (GACTCA) is being used. This provides for directionality and selective joining of the 5′ end with the CMV promoter and the 3′ with the BGH terminator.

The joining reaction is carried out by mixing the 5′ CMV element, IgG PCR product, and 3′BGH element at a 1:2:1 ratio, and adding the 10× reaction buffer. The reaction proceeds rapidly and is usually complete within 10 min at room temperature. Following the reaction, a secondary PCR reaction is carried out, using primers corresponding to the 5′ end of the CMV promoter and the 3′ end of the BGH terminator (primers 1 and 4, see above). This results in the formation of the 2.1 kb IgG H expression cassette, or the 1.5 kb IgG L expression cassette. Conditions for co-production of H and L IgG expression cassettes in the same reaction are also envisioned.

The IgG H expression cassettes are cloned into a vector carrying a hygromycin resistance marker to generate an IgG H expression cassette library. The IgG L expression cassettes are cloned into a vector carrying a G418 resistance marker to generate an IgG L expression cassette library.

Equimolar amounts of the IgG H and IgG L expression cassette libraries are mixed and transfected into CHO cells. The transfected CHO cells are plated into 96-well or 384-well microtiter plates such that each well contains approximately one cell. Cells are maintained in media containing both hygromycin and G418. Cells that survive the double selection contain at least one expression cassette pair.

These cells are cultured and the antibodies produced by these cells are tested for binding to the antigen with which the rabbit was immunized.

Example 2 Related Antibodies

Antibodies were obtained from rabbit hybridoma cells producing anti-KDR antibodies that block the interaction of VEGF with its receptor (KDR). The hybridoma cells were generated by fusing immunized rabbit splenocytes with the rabbit hybridoma fusion partner 240E-W2.

New Zealand white rabbits were immunized with a fusion protein containing the rabbit Fc region and the extracellular domain of KDR. Each rabbit received a primary immunization by subcutaneous injection of 0.4 mg of the purified protein with complete Freund's or TiterMax adjuvant. The animals were then boosted by subcutaneous injection of 0.2 mg of the protein with incomplete Freund's or TiterMax once every three weeks. The final boost (0.4 mg protein in saline) was given intravenously 4 days before splenectomy.

Cell fusions were performed following the conventional protocol of Spieker-Polet using PEG. The ratio of splenocytes to the fusion partner was 2:1. The fused cells were plated in 96-well plates and HAT was added after 48 hrs to select for hybridomas. Direct ELISA was performed to identify antibodies that block binding of VEGF to a KDR fusion protein coated onto a microtiter plate. In this assay, the Fc-KDR ECD fusion protein was coated onto a 96-well ELISA plate and goat anti-rabbit IgG FEB conjugated to alkaline phosphatase was used to detect antibody binding to KDR. Antibodies identified in this assay were then were screened for blocking VEGF interaction with KDR in a ligand-receptor assay. The blocking antibodies were identified by their inhibition of binding of VEGF in solution to KDR coated on plates.

cDNAs coding the heavy and light chains of the antibodies were cloned and sequenced. The polypeptides encoded by the cDNAs were aligned and this alignment is shown in FIG. 2. FIG. 2 shows that two groups of related anti-KDR rabbit monoclonal Abs were obtained. Antibodies 69, 6, 71, 43, 81, 4, 30, 54, 57, 50, 68, 56, 83, 36, 77, 95, 14, 42, 27 belong to one group. Antibodies 2, 17, 3, 6, 9 belong to a different group.

FIG. 3 is a multiple sequence alignment of the H3 region of ten rabbit antibody sequences extracted from the Kabat database to illustrate the expected variation in unrelated antibodies.

Example 3 CDR-Anchored Amplification of Polynucleotides Encoding Related Antibodies

Several examples illustrating a method by which the amino acid sequences of related rabbit antibodies may be obtained by PCR are set forth in FIGS. 4A-4H. In the examples shown in FIGS. 4A-4D, reverse primers that are complementary to the CDR3 regions of the light chain of antibodies 31 (FIG. 4A), 29 (FIG. 4 b), 27 (FIG. 4 c) and 20 (FIG. 4 d) were designed and can be used along with a universal forward primer (SEQ ID NO: 118) that binds to a site that is present in all rabbit antibody heavy chain sequences to amplify coding sequences for related antibodies. In the example shown in FIG. 4A, the primers designed against sequences that encode antibody 31 are expected to amplify light chain variable domain sequences for antibodies 11, 12, 2, 25, 22, 27, 3, 1, 19, 24, 23, 18, 13, 10 and 21, which are all from the same animal as antibody 31 and are related to antibody 31 by lineage. In the example shown in FIG. 4B, the primers designed against sequences that encode antibody 29 are expected to amplify light chain variable domain sequences for antibodies 8, 9, 16 and 32, which are all from the same animal as antibody 29 and are related to antibody 29 by lineage. In the example shown in FIG. 4C, the primers designed against sequences that encode antibody 27 are expected to amplify light chain variable domain sequences for other antibodies which are all from the same animal as antibody 27 and are related to antibody 27 by lineage. In the example shown in FIG. 4D, the primers designed against sequences that encode antibody 20 are expected to amplify light chain variable domain sequences for other antibodies which are all from the same animal as antibody 20 and are related to antibody 20 by lineage.

In the examples shown in FIGS. 4E-4H, reverse primers that are complementary to the CDR3 regions of the heavy chain of antibodies 31 (FIG. 4E), 29 (FIG. 4F), 29 (FIG. 4G) and 21 (FIG. 4H) were designed and can be used along with a universal forward primer (SEQ ID NO: 148) that binds to a site that is present in all rabbit antibody heavy chain sequences to amplify coding sequences for related antibodies. In the example shown in FIG. 4E, the primers designed against sequences that encode antibody 31 are expected to amplify heavy chain variable domain sequences for antibodies 2, 17, 22, 25, 12, 1, 24, 19, 25, 11, 31, 3, 10, 13, 21, 18 and 23, which are all from the same animal as antibody 31 and are related to antibody 31 by lineage. In the example shown in FIG. 4F, the primers designed against sequences that encode antibody 29 are expected to amplify heavy chain variable domain sequences for antibodies 8, 9, 16 and 32, which are all from the same animal as antibody 29 and are related to antibody 29 by lineage. In the example shown in FIG. 4G, the primers designed against sequences that encode antibody 27 are expected to amplify heavy chain variable domain sequences for other antibodies which are all from the same animal as antibody 27 and are related to antibody 27 by lineage. In the example shown in FIG. 4H, the primers designed against sequences that encode antibody 20 are expected to amplify heavy chain variable domain sequences for other antibodies which are all from the same animal as antibody 20 and are related to antibody 20 by lineage. 

What is claimed is:
 1. A method for identifying an antibody that binds to an antigen, comprising: a) obtaining: (i) antibody heavy chain variable domain sequences and (ii) antibody light chain variable domain sequences by sequencing heavy and light chain-encoding cDNAs made from a population of antibody-producing B cells of an animal that has been immunized by the antigen; b) grouping the heavy chain variable domain sequences of step a) into a plurality of heavy chain groups and, independently, grouping the light chain variable domain sequences of step a) into a plurality of light chain groups, wherein: (i) heavy chain sequences that comprise CDR3 regions that are of the same length and contain up to five amino acid substitutions relative to one another are grouped together; and (ii) light chain sequences that comprise CDR3 regions that are of the same length and contain up to five amino acid substitutions relative to one another are grouped together; c) selecting a heavy chain group and a light chain group from the groups produced by step b); d) pairing a heavy chain variable domain sequence and a light chain variable domain sequence from the groups selected in c); and e) testing an antibody comprising the heavy and light chain sequences paired in d) for binding to the antigen.
 2. The method of claim 1, wherein the heavy and light chain variable domain sequences paired in step d) are not naturally paired together in the animal of step a).
 3. The method of claim 1, wherein: (i) heavy chain sequences that comprise CDR3 regions that are of the same length and contain 0, 1 or 2 amino acid substitutions relative to one another are grouped together; and (ii) light chain sequences that comprise CDR3 regions that are of the same length and contain 0, 1 or 2 amino acid substitutions relative to one another are grouped together.
 4. The method of claim 1, wherein the grouping step (b) further comprises comparing the entire lengths of the heavy and light chain variable domain sequences to one another.
 5. The method of claim 1, wherein the obtaining step a) is done by sequencing cDNAs encoding the heavy and light chain variable domain sequences from single B cells, or cultures of the same.
 6. The method of claim 1, wherein the obtaining step a) is done by sequencing cDNAs encoding the heavy and light chain variable domain sequences made from a plurality of pools of B cells.
 7. The method of claim 1, wherein the heavy and light chain groups of selecting step c) are selected by analysis of lineage trees.
 8. The method of claim 1, wherein, prior to step a), the antibody-producing B cells are enriched by binding to the antigen.
 9. The method of claim 1, wherein, prior to step a), the antibody-producing B cells are not enriched by binding to the antigen.
 10. The method of claim 1, wherein the selected heavy chain and light chain groups selected in step c) both contain at least 10 members.
 11. The method of claim 1, wherein the testing step e) comprises testing the antibody of e) in a blocking assay and/or a neutralization assay.
 12. The method of claim 1, wherein the testing step e) comprises testing the antibody of e) in an ELISA.
 13. The method of claim 1, wherein the obtaining step a) comprises obtaining at least 1,000 different antibody heavy chain variable domain sequences and at least 1,000 different antibody light chain variable domain sequences.
 14. The method of claim 1, wherein the antibody-producing B cells are cells obtained from spleen, lymph node or peripheral blood.
 15. The method of claim 1, wherein the animal is a rabbit.
 16. A method for identifying a monoclonal antibody comprising: a) obtaining a population of antibody-producing B cells of an animal that has been immunized by an antigen; b) making pools of the antibody-producing B cells obtained in step a) to produce a plurality of pools; c) sequencing antibody heavy chain and antibody light chain-encoding cDNAs from each of the pools of step b); d) grouping the heavy chain variable domain sequences of step c) into a plurality of heavy chain groups and, independently, grouping the light chain variable domain sequences of step c) into a plurality of light chain groups, wherein: (i) heavy chain sequences that comprise CDR3 regions that are of the same length and contain up to five amino acid substitutions relative to one another are grouped together; and (ii) light chain sequences that comprise CDR3 regions that are of the same length and contain up to five amino acid substitutions relative to one another are grouped together; e) selecting a heavy chain group and a light chain group from the groups produced by step d); f) pairing a heavy chain variable domain sequence and a light chain variable domain sequence from the groups selected in step e); and g) testing an antibody comprising the heavy and light chain sequences paired in step f) for binding to the antigen.
 17. The method of claim 16, wherein the pools comprise at least 1,000 different antibody-producing B cells.
 18. The method of claim 16, wherein the sequencing step c) provides at least 10,000 different heavy chain variable domain sequences and at least 10,000 different light chain variable domain sequences.
 19. The method of claim 16, wherein, in step d): (i) heavy chain sequences that are grouped together comprise CDR3 regions that are of the same length and contain 0, 1, 2 or 3 amino acid substitutions relative to one another; and (ii) light chain sequences that are grouped together comprise CDR3 regions that are of the same length and contain 0, 1, 2 or 3 amino acid substitutions relative to one another.
 20. The method of claim 16, wherein the grouping step d) further comprises comparing the entire lengths of the heavy and light chain variable domain sequences to one another.
 21. The method of claim 16, wherein the heavy and light chain groups selected in step e) are selected by analysis of lineage trees.
 22. The method of claim 16, wherein, prior to step a), the B cells are enriched by binding to the antigen.
 23. The method of claim 16, wherein, prior to step a), the B cells are not enriched by binding to the antigen.
 24. The method of claim 16, wherein the grouping of step a) results in at least 10 groups of heavy chain sequences and at least 10 groups of light chain sequences.
 25. The method of claim 16, wherein the B cells are cells obtained from spleen, lymph node or peripheral blood.
 26. The method of claim 16, wherein the animal is a rabbit. 